(2) INFORMATION FOR SEQ ID NO;!;- 

(i) SEQDENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDBDNESS : both 
(D) TOPOLGGV: both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY I CDS 

(B) LOCATION: 16 $..1755 

(D) OTHER INFORMATION: /prcductw "ALPHA-i SUBUNIT" 

(Xi) SEQUENCB DESCRIPTION: SEQ ID N0:1: 

CAATGACCTG TTTTCTTCTG TAACCACAGO TTCGGTGCTG AGAGGAASCY TCGCAGAATC 60 

CAGCAGAATC CTCACAGAAT CCAGCAGCAG CTCTGCTGGG GACATGGTCC ATGGTGCAAC 120 

CCACAGCAAA GCCCTGACCT GACCTCCTGA TQCTCAGGAG AA3GGATGGG GCCGTGGTGT ISO 

CCTGTGTTCC TGTCCTTCAC AAAGCTCAGC CTGTGGTGGC TCCTTCTGAC CCCAGCAGGT 240 

GOAGAGGAAG CTAAGCGCCC ACCTCCCAGG GCTCCTGGAG ACCCACTCTC CTCTCCCAGT 300 

CCCACGGCAT TGCCGCAGGG AGGCTCGCAT ACCGAGACTG AGGACCGGCT CTTCAAACAC 360 

CTCTTCCGGG GCTACAACCG CTGGGCGCGC CCGGTGCCCA ACACTTCAGA CGTGGTGATT 420 

GTGCCCTTTG GACTGTCCAT CGCTCAGCTC ATCGATGTGC ATGAGAAGAA CCAAATGATG 480 

ACCACCAACG TCTGGCTAAA ACAGGAGTGQ AGCGACTACA AACTGCGCTG GAACCCCGCT 540 

GATTTTGGCA ACATCACATC TCTCAGGGTC CCTTCTGAGA TGATCTGGAT CCCCGACATT €00 

GTTCTCTACA ACAATGCAGA TGGGGAGTTT GCAGTGACCC ACATGACCAA GGCCCACCTC 660 

rrCTCCACGG GCACTGTGCA CTCGGTGCCC CCGGCCATCT ACAAGAGCTC CTGCAGCATC 720 

GACGTCACCT TCTTCCCCTT CGACCAGCAG AACTGCAAGA TGAAGTTTGG CTCCTGGACT 760 

TATGACAAGG CCAAGATCGA CCTGGAGCAG ATGOAGCAGA CTGTGGACCT GAAGGACTAC 840 

TGGQAGAGCG GCGAGTGGGC CATCGTCAAT GCCACGGGCA CCTACAACAG CAAQAAGTAC 900 

GACTGCTGCG CCGAGATCTA CCCCGACGTC ACCTACGCCT TCQTCATCCG GCGGCTGCCQ 960 

CTCTTCTACA CCATCAACCT CATCATCCCC TGCCTGCTCA TCTCCTGCCT CACTGTGCTG 1020 

GTCTTCTACC TGCCCTCCQA CTGCGGCGAG AAGATCACGC TGTGCATTTC GGTGCTQCTG 1080 

TCACTCACCG TCTTCCTGCT GCTCATCACT GAGATCATCC CGTCCACCTC GCTCGTCATC 1140 

CCGCTCATCC GCGAGTACCT GCTGTTCACC ATGATCTTCG TCACCCTGTC CATCGTCATC 1200 

ACCGTCTTCQ TGCTCAATGT GCACCACCGC TCCCCCAOCA CCCACACCAT GCCCCACTGG 1260 

GTOCGGGGGG CCCITCTGGG CTC3TGTGCCC CGGTOGCTTC TGATGAACCG GCCCCCACCA 1320 

CCCGTGGAQC TCTGCCACCC CCTACGCCTG AAGCTCAGCC CCTCTTATCA CTGGCTGGAO 1380 

AOCAACGTGG ATGCCQAGGA CAGGGAGGTG GTGGTGGAGG AGGAGGACAG ATGGGCATGT 1440 



GCAGGTCATC TGGCCCCCTC TGTGGGCACC CTCTGCAGCC ACGGCCACCT GCACTCTGGG 1500 

GCCTCAGGTC CCAAGGCTGA GGCTCTGCTG CAGGAGGGTG AGCTGCTGCT ATCACCCCAC 1560 

ATGCAGAAGG CACTGGAAGG TGTGCACTAC ATTGCCGACC ACCTGCGGTC TGAGGATGCT 1620 

GACTCTTCGG TGAAGGAGGA CTGGAAGTAT GTTGCCATGG TCATCGACAG GATCTTCCTC 1660 

TGGCTGTTTA TCATCOTCTG CTTCCTGGGG ACCATCGGCC TCTTTCTGCC TCCGTTCCTA 1740 

GCTGGAATGA TCTGACTQCA CCTCCCTCGA GCTGGCTCCC AGGGCAAAGG GGAGGGTTCT 160 0 

TQGATGTGGA AGGGCTTTGA ACAATGTTTA GATTTGOAGA TGAGCCCAAA GTCCCAGGGA 1860 

6AACAGCCAG GTGAGGTGGG AGGTTGGAGA GCCAGGTGAG GTCTCTCTAA GTCAGGCTGG 1920 

GGTTGAAGTT TGGAGTCTGT CCGAGTTTGC AGGGTGCTCA GCTGTATGGT CCAGCAGGGG 19 BO 

AGTAATAAGG GCTCTTCCGG AAGGGGAOGA AGCGGGAGGC AGGGCCTCCA CCTGATGTGG 2040 

AGGTACAGGG CAGATCTTCC CTACCGGGGA GGGATGGATG GTTGGATACA GGTGGCTGGO 2100 

CTATTCCATC CATCTGGAAG CACATTTGAG CCTCCAGGCT TCTCCTTQAC GTCATTCCTC 2160 

fGCTTCCTTG CTCCAAAATG GCTCTGCACC AQCCGGCCCC CAQGAGGTCT GQCAGAGCTQ 2220 

AGAGCCATGG CCTGCAGGGG CTCCATATGT CCCTACGCGT GCAGCAGGCA AACAAGA 2277 
12) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 1 529 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY ! unknown 

(ii) MOLECULE TVPfi; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0t2: 

Met Gly Pro Ser Cys Fro Val Phe Leu Ser Phe Thr Lys Leu Ser Leu 
15 10 15 

Txp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu Ala Lys Arg Pro 
20 25 30 

Pro Pro Arg Ala Pro Qly Asp Pro Leu Ser Ser Pro Ser Pro Thr Ala 
35 40 45 

Ijeu Pro Qln Gly Gly Ser His Thr Glu Thr Glu Asp Arg Leu Phe Lys 
50 55 60 

Eis Leu Phe Arg Gly Tyr Asn Arg Trp Ala Ar^ Pro Val Pro Asn Thr 
65 70 75 80 

Ser Asp Val Val lie Val Arg Phe Gly Leu Ser He Ala Gin Leu He 
65 90 95 

Aap Val Asp Glu Lys Asn Gin Met Met Thr Thr Aen Val Trp Leu Lys 
100 105 110 

Gin Glu Trp Ser Asp Tyr Lyo Leu Arg Trp Asn Pro hlsi Aap Phe Gly 
lis 120 125 

Asn He Thr Ser Leu Arg Val Pro Ser Glu Met He Trp He Pro Asp 
1^0 135 140 



lie Val Leu Tyr Asn Asn Ala Asp Gly Glu Phe Ala Val Thr His Met 
150 155 160 

Thr Lye Ala Hie Leu Phe Ser Thr Gly Thr Val His Trp Val Pro Pro 
165 170 175 

Ala lie Tyr Lys Ser Ser Cye Ser lie Asp Val Thr Phe Phe Pro Phe 
160 IBS 190 

Asp Gin Gin Asn Cys Lys Met Lys Phe Gly Ser Trp Thr Tyr Aap Lys 
155 200 205 

Ala Lys He Asp Leu Glu Gin Met Glu Gin Thr Val Asp Leu Lys Asp 
210 215 220 

Tyr Tip Glu Ser Gly Glu Trp Ala He Val Asn Ala Thr Gly Thr Tyr 
230 235 240 

Asn Ser Lye Lys Tyr Asp Cys Cys Ala Glu He Tyr Pro Asp Val Thr 
245 250 255 

Tyr Ala Phe Val He hrg Arg Leu Pro Leu Phe Tyr Thr He Asn Leu 
260 265 270 

He He Pro Cys Leu Leu He Ser Cys Leu Thr Val Leu Val Phe Tyr 
275 280 285 

Leu Pro Ser Asp Cye Gly Glu Lys He Thr Leu Cye He Ser Val Leu 
290 295 300 

Leu Ser Leu Thr Val Phe Leu Leu Leu He Thr Glu He He Pro Ser 
310 315 320 

Thr Ser Leu Val He Pro Leu He Gly Glu Tyr Leu Leu Phe Thr Met 
325 330 335 

He Phe Val Thr Leu Ser He Val He Thr Val Phe Val Leu Asn Val 
340 345 

His His Arg Ser Pro Ser Thr His Thr Met Pro His Trp Val Arg Gly 
355 360 365 

Ala Leu Leu Gly Cye Val Pro Arg Trp Leu Leu Met Asn Arg Pro Pro 
370 375 380 

Pro Pro Val Glu Leu Cys His Pro Leu Arg Leu Lys Leu Ser Pro Ser 
3B5 390 395 400 

Tyr His Trp Leu Glu Ser Asn Val Asp Ala Glu Glu Arg Glu Val Val 
405 410 415 

Val Glu Glu Glu Aap Arg Trp Ala Cys Ala Gly His Val Ala Pro Ser 
420 425 430 

Val Gly Thr Leu Cys Ser His Gly His Leu His Ser Gly Ala Ser Gly 
435 440 445 

Pro Lys Ala Glu Ala Leu Leu Gin Glu Gly Glu Leu Leu Leu Ser Pro 
450 455 460 

His Met Gin Lys Ala Leu Glu Gly Val His Tyr He Ala Asp His Leu 
465 470 475 480 

Arg Ser Glu Aep Ala Asp Ser Ser Val Lys Glu Asp Trp Lys Tyr Val 
4B5 490 495 



Ala Met Val He Asp Arg He Phe Leu Trp Leu Phe He He Val Cys 
500 505 510 

Phe Leu Gly Thr He Gly Leu Phe Leu Pro Pro Phe L u Ala Gly Met 
515 520 525 

He 

(2) INFORMATION FOR SEQ ID NO; 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L2NGTHI 1757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY; both 

lii) MOLECULE TYPE: cDNA 



(ix) FEATTOE; 

(A) NAME/KEY I CDS 

(B) LOCATIONi 39.. 1553 

(D) OTHER INFORMATION: /product^ "ALPHA-3 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0i3i 



CCGACCGTCC 


GGGTCCGCGG 


CCAGCCCGGC 


CACCAGCCAT 


GGGCTCTGGC 


CCOCTCTCGC 


60 


TGCCCCTGGC 


GCTGTCGCCG 


CCGCGGCTGC 


TGCTGCTGCT 


GCTGTCTCTG 


CTGCCAGTGG 


120 


CCAGGGCCTC 


AGAGGCTGAG 


CACCGTCTAT 


TTGAGCGGCT 


GTTTGAAGAT 


TACAATGAGA 


IBG 


TCATCCGGCC 


TGTAGCCAAC 


GTGTCTGACC 


CAGTCATCAT 


CCATTTCCAG 


OTGTCCATGT 


240 


CTCAGCTGGT 


GAAGGTGGAT 


GAAGTAAACC 


AGATCATGGA 


GACCAACCTG 


TGGCTCAAGC 


300 


AAATCTGGAA 


TGACTACAAG 


CTGAAGTGGA 


ACCCCTCTGA 


CTATGGTGGG 


GCAGAGTTCA 


360 


TGCGTGTCCC 


TGCACAGAAG 


ATCTGGAAGC 


CAGACATTGT 


GCTGTATAAC 


AATGCTGTTG 


420 


GGGATTTCCA 


GGTGGACGAC 


AAGACCAAAG 


CCTTACTCAA 


GTACACTGGG 


GAGGTGACTT 


4S0 


OGATACCTCC 


GGCCATCTTT 


AAGAGCTCCT 


GTAAAATCQA 


CGTGACCTAC 


TTCCCGTTTG 


540 


ATTACCAAAA 


CTGTACCATG 


AAGTTCGGTT 


CCTGGTCCTA 


CGATAAGGCG 


AAAATCOATC 


600 


TGGTCCTGAT 


CGGCTCTTCC 


ATGAACCTCA 


AGGACTATTG 


GGAGAGCGGC 


GA6TGGGCCA 


660 


TCATCAAAGC 


CCCAGGCTAC 


AAACACGACA 


TCAAGTACAG 


CTGCTGCGAG 


GAGATCTACC 


720 


CCGACATCAC 


ATACTCGCTG 


WWCATCCGCC 


GGCTGTCGTT 


GTTCTACACC 


ATCAWCCTCA 


780 


TCATCCGCTQ 


GCTGATCATC 


TCCTTCATCA 


CTGTGGTCGT 


CTCCTACCTG 


CCCTCCGACT 


840 


GCGGCGAGAA 


GGTGACCCTQ 


TGYATTTCTG 


TCCXCCTCTC 


CCTGACGGTG 


TTTCTCCTGG 


900 


TGATCACTGA 


GACCATCCCT 


TCCACCTCOC 


TGGTCATCCC 


CCTGATTGGA 


GAGTACCTCC 


960 


TQWWCACCAT 


GATTTGTGTA 


ACCTTGTCCA 


TCGACATCAC 


CGTCTGCGTQ 


CTCAACGTGC 


1020 


ACTACAGAAC 


CCCGACGACA 


CACACAATGC 


CCrCATGGGT 


GAAGACTGTA 


TTCTTGAMCC 


1060 


TGCTCCCCAG 


GGTCATGTWC 


ATGACCAGGC 


CAACAAGCAA 


CGAGGGCAAC 


GCTCAGAAGC 


1140 


CGAGGCCCCT 


CTACGGTGCC 


GAGCTCTCAA 


ATCTGAATTG 


CTTCACCCGC 


GCAGAGTCCA 


1200 



AAGGCTGCAA GGAGGGCTAC CCXTTGCCAGG ACGGGATGTG TGGlTACTGC CACCACCGCA 1260 

GGATAAAAAT CTCCAATTTC AGTGCTAACC TCACGAGAAG CTCTAGTTCT GAATCTGTTG 1320 

ATC3CTGTGCT OTCCCTCTCT GCTTTGTCAC CAGAAATCAA AGAAGCCATC CAAAGTGTCA 13 60 

AGTATATTGC TCAAAATATG AAAGCACAAA ATGAAGCCAA AGAGATTCAA GATCATTGGA 1440 

AGTATGXTGC CATGGraATT GATCGTATIT TTCTGTCGGT TTTCACCCTC GTGTGCATTC 1500 

TAGGGACAGC AGGATTGTTT CTGCAACCCC TGATGGCCAG GGAAGATGCA TAAGCACTAA 1560 

GCTGTGTGCC TGCCTGGGAG ACTTCCTTGT GTCAGGGCAG GAGGAGGCTG CTTCCTAOTA 1620 

AGAACGTACT TTCTGTTATC AAQCTACCAG CTTTTyr Asp Lye Ala Lye lie Asp Leu Val Leu 
lie Gly 

180 



165 



190 



Ser Ser Met Asn Leu Lys Asp Tyr Trp Glu Ser Gly Glu Trp Ala He 
195 2O0 205 

lie Lys Ala Pro Gly Tyr Lys His Asp He Lys Tyr Ser Cys Cys Glu 
210 215 220 

Glu He Tyr Ps^o Asp He Thr Tyr Ser Leu Xaa He Arg Arg Leu Ser 
225 230 235 240 

Leu Phe Tyr Thr He Xaa Leu He He Arg Trp Leu He He Ser Phe 
245 250 255 

He Thr Val Val Val Ser Tyr Leu Pro Ser Asp Cys Gly Glu Lys Val 
260 265 270 

Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val 
275 2B0 285 

He Thr Glu Thr He Pro Ser Thr Ser Leu Val He Pro Leu He Gly 
290 295 300 

Glu Tyr Leu Leu Xaa Thr Met He Cys Val Thr Leu Ser He Asp He 
305 310 315 320 

Thr Val Cya Val Leu Asn Val Hie Tyr Axg Thr Pro Thr Thr His Thr 
325 330 335 

Met Pro Ser Trp Val Lye Thr Val Phe Leu Xaa Leu Leu Pro Arg Val 
340 345 350 

Met Xaa Met Thr Arg Pro Thr Ser Asn Glu Gly Asn Ala Gin Lys Pro 
355 360 365 

Arg Pro Leu Tyr Gly Ala Glu Leu Ser Asn Leu Asn Cye Phe Ser Arg 
370 375 380 

Ala Glu Ser Lys Gly Cys Lya Glu Gly Tyr Pro Cys Gin Asp Gly Met 
385 390 395 400 

Cys Gly Tyr Cys His Hie Arg Arg He Lys He Ser Aen Phe Ser Ala 
405 410 415 

Asn Leu Thr Arg Ser Ser Ser Ser Glu Ser Val Asp Ala Val Leu Ser 
420 425 430 

Leu Ser Ala Leu Ser Pro Glu He Lys Glu Ala He Gin Ser Val Lys 
435 440 445 



Tyr lie Ala Glu Aen Met l,ys Ala Gin Aan Glu Ala Lys Glu He Gin 
450 455 460 

Asp Asp Trp Lye Tyr Val Ala Met Val He Asp Arg He Phe Leu Tip 
465 470 475 400 

Val Phe Thr Leu Val Cys He Leu Gly Thr Ala Oly Leu Phe Leu Gin 
485 490 495 

Fro Leu Met Ala Arg Glu Asp Ala 
500 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTBRISTICS: 

(A) LENGTH: 2363 base pairs 

(B) TYPE I nucleic acid 

(C) STRANDEDNESS; both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



( ix) FEATURE I 

(A) NAME/KEY i CDS 

(B) LOCATION: 173.. 2056 

(D) OTHER INFORMATION; /products "ALPHA-4 SUBUNIT" 



<xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 



GCGCTCGCTG 


CGGCGCCGCC 


GCCGCNCCGC 


GCGCCACAGG AGAAGGCGAN 


CCGGGCCCGG 


60 


CGGCCGAAGC 


GGCCCGCGAG 


GCGCGGGAQG 


CATGAAGTTG GGCGCGCACG 


GGCCTCGAAG 


120 


CGGCGGGGAG 


CCCGGAGCCG 


CCCGCATCTA 


GAGCCCGCGA GGTGCGTGCG 


CCATGGAGCT 


180 


AGGGGGCCCC 


GGAGCGCCGC 


GGCTGCTGCC 


GCCGCTGCTG CTGCTTCTGG 


GGACCGGCCT 


240 


CCTGCGCGCC 


AGCAGCCATG 


TGGAGACCCG 


GGCCCACGCC GAGGAGCGGC 


TCCTGAAGAA 


300 


ACTCTTCTCC 


GGTTACAACA 


AGTGGTCCCG 


ACCCGTGGCC AACATCTCGG 


ACGTGGTCCT 


360 


CGTCCGCTTC 


GGCCTGTCCA 


TCGCXCAGCT 


CATTGACGTG GATGAGAAGA 


ACCAGATGAT 


420 


GACCACGAAC 


GTCTGGGTGA 


AGCAGGAGTG 


GCACGACTAC AAGCTGCGCT 


GGGACCCAGC 


4B0 


TGACTATGAC 


AATGTCACCT 


CCATCCGCAT 


CCCCTCCGAG CTCATCTGGC 


GGCCGGACAT 


540 


CGCCCTCTAC 


AACAATGCTG 


ACGGGGACTT 


CGCGGCCACC CACCTGACCA 


AGGCCCACCT 


600 


GTTCCATQAC 


GQQCGGGTGC 


AGCGGACTCC 


CCCGGCCATT TACAAGAGCT 


CCTGCAGCAT 


660 


CGACGTCACC 


TTCTTCCCCT 


TCGACCAQCA 


GAACTQCACC ATGAAATTCG 


GCTCCTGGAC 


720 


CTACGACAAG 


GCCAAGATCG 


ACCTGGTGAA 


CATGCACAGC CGCGTGGACC 


AGCTGGACTT 


780 


CTGGGAGAGT 


GGCGAGTGCC 


TCATCTCGGA 


CGCCGTGGGC ACCTACAACA 


CCAOGAAGTA 


640 


CGAOTGCTGC 


OCCGAGATCT 


ACCCGGACAT 


CACCTATQCC TACGCCATCC 


GGCGGCTGCC 


900 


CCTCTTCTAC 


ACCATCAACC 


TCATCATCCC 


CTGGCTGCTC ATCTCCTGCC 


TCACCGCGCT 


960 


CGTCTTCTAC 


CTGCCCTCCG 


AGTGTGGCGA 


GAAGATCACG CTGTGCATCT 


CCGTGCTGCT 


1020 


GTCGCTCACC 


GTCTTCCTGC 


TGCTCATCAC 


CGAGATCATC CCGTCCACCT 


CACTQGTCAT 


1080 



CCCACTCATC GGCC3AGTACC TGCTOTTCAC CATGATCTTC GTCACCCTGT CCATCGCCAT 1140 

CACQGTCTTC GTCCTCAACG TGCACCACCG CTCGCCACOC ACGCACACCA TOCCCACCTG 1200 

GGTACGCAGG GTCTTCCTGG ACATCGTGCC ACGCCTGCTC CTCATGAAGC GGCCGTCCGT 1260 

GGTCAAGGAC AATTGCCGGC GGCTCATCGA GTCCATGCAT AAGATGGCCA GTGCCCCGCG 1320 

CTTCTGGCCC GAGCCAGAAG GGGAGCCCCC TGCCACGAGC GGCACCCAGA GCCTGCACCC 1380 

TCCCTCACCG TCCTTCTGCG TCCCCCTGGA TGTGCCGGCT GAGCCTGGGC CTTCCTGCAA 1440 

GTCACCCTCC GACCAGCTCC CTCCTCAGCA GCCCCTGGAA GCTGAGAAAG CCAGCCCCCA 1500 

CCCCTCCCCT OGACCCTGCC GCCCGCCCCA CGGCACCCAG GCACCAGGGC TGGCCAAAGC 1560 

CAGGTCCCTC AGCQTCCAGC ACATGTCCAG CCCTGGCGAA GCGGTGGAAG GCGGCGTCCG 1620 

CTGCCGGTCT CGGAGCATCC AGTACTGTGT TCCCCGAGAC GATGCCGCCC CCGAGGCAQA 16 BO 

TCGCCAGGCT GCCGGCGCCC TGGCCTCTCG CAACAGCCAC TCGGCTGAGC TCCCACCCCC 1740 

AGACCAGCCC TCTCCGTGCA AATGCACATG CAAGAACGAG CCCTCTTCGG TGTCCCCGAG IB 00 

CGCCACGGTC AAGACCCGCA GCACCAAAGC GCCGCCGCCG CACCTGCCCC TGTCGCCGGC 1860 

CCXGAGCCGG GCGGTGGAGG GCGTCCAGTA CATTGCAGAC CACCTGAAGG CCGAAGACAC 1920 

AGACTTCTCG GTGAAGGAGG ACtGGAAGTA CGTGGCCATG GTCATCGACC GCATCTTCCT 1980 

CTGGATGTTC ATCATCGTCT GCCTGCTGGO GACGGTGGOC CTCTTCCTGC CGCCCTGGCT 2040 

GGCTGGCATC ATCTAGGAAG GGACCGGGAQ CCTGCGTGGC CTGGGGCTGC CGYGCACGGG 2100 

GCCAGCATCC ATGCGGCCGG CCTGGGGCCC GGCTGGCTTC TCCCTOGACX CTGTGGGGCC 21S0 

ACACGTTTGC CAAATTTTCC TTCCTGTTCT GTGTCTGCTG TAAGACGGCC TTGGACGGCG 2220 

ACACGGCCTC TGGGGAGACC GAGTGTGGAG CTGCTTCCAG TTGGACTCTS GCCTCACMAG 22 80 

GCAGCGGCTT GGAGCAGAGG TGGCGGTCGC CGCCTYCTAC CTGCAGGACT CGGGCTAAGT 2340 

CCAGCTCTCC CCCTGCGCAG CCC 2363 

(2) IITPORMATION FOR SEQ ID N0:6: 

(i) SEQITBIJCE CHARACTERISTICS: 

(A) LENGTH: 627 amino acidfl 
(Bl TYPE: amino acid 
(D) TOPOLOGY: uii3tnown 

(ii) MOLECULE TYPE; protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Leu Gly Gly Pro Gly Ala Pro Arg Leu Leu Pro Pro Leu Leu 
15 10 15 

Leu Leu Leu Gly Thr Gly Leu Leu Arg Ala Ser Ser Hie Val Glu Thr 
20 25 30 

Arg Ala His Ala Glu Glu Arg Leu Leu Lys Lys Leu Phe Ser Gly Tyr 
35 40 45 

Asa Lys Trp Ser Arg Pro Val Ala Asn He Ser Asp Val Val Leu Val 
50 55 60 



Arg Phe Gly Leu Ser lie Ala- Gin I,eu lie Asp Val Asp Glu Lys Asn 
55 70 75 80 

Gin t^et Met Thr Thr Asn Val Trp Val Lys Gin Glu Trp His Asp Tyr 
85 so 95 

Lys Leu Arg Trp Asp Pro Ala Asp Tyr Glu Asn Val Thr Ser lie Arg 
100 105 110 

lie pro Ser Glu Leu lie Trp Arg Pro Aep He Ala Leu Tyr Asn Asn 
115 120 125 

Ala Asp Gly Asp Phe Ala Ala Thr His Leu Thr Lys Ala Hia Leu Phe 
130 135 140 

His Asp Gly Arg Val Gin Arg Thr Pro Pro Ala He Tyr Lye Ser Ser 
145 ISO ISS 160 

Cys Ser He Asp Val Thr Phe Phe Pro Phe Asp Gin Gin Asn Cys Thr 
165 170 175 

Met Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Lys He Asp Leu Val 
180 IBS 190 

Asn Met His Ser Arg Val Asp Gin Leu Aep Phe Trp Glu Ser Gly Glu 
195 200 205 

Trp Leu He Ser Asp Ala Val Gly Thr Tyr Asn Thr Arg Lys Tyr Glu 
210 215 220 

Cys Cys Ala Glu He Tyr Pro Asp He Thr Tyr Ala Tyr Ala He Arg 
225 230 235 240 

Arg Leu Pro Leu Phe Tyr Thr He Asn Leu He He Pro Trp Leu Leu 
245 250 255 

He Ser Cys Leu Thr Ala Leu Val Phe Tyr Leu Pro Ser Glu Cys Gly 
260 2^5 270 

Glu Lys He Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe 
275 260 2S5 

Leu Leu Leu He Thr Glu He He Pro Ser Thr Ser Leu Val He Pro 
290 295 300 

Leu He Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser 
305 310 315 320 

He Ala He Thr Val Phe Val Leu Asn Val His His Arg Ser Pro Arg 
325 330 335 

Thr His Thr Met Pro Thr Trp Val Arg Arg Val Phe Leu Asp He Val 
340 345 350 

Pro Arg Leu Leu Leu Met Lys Arg Pro Ser Val Val Lys Asp Asn Cys 
355 3S0 365 

Arg Arg Leu He Glu Ser Met His Lys Met Ala Ser Ala Pro Arg Phe 
370 375 380 

Trp Pro Glu Pro Glu Gly Glu Pro Pro Ala Thr Ser Gly Thr Gin Ser 
365 390 395 400 



Leu His Pro Pro Ser Pro Ser Phe Cys Val Pro Leu Asp Val Pro Ala 
405 410 415 



Glu Pro Gly Pro Ser Cya Ly$ Ser Pro Ser Aep Gin Leu Pro Pro Gin 
420 425 

Gin Pro Leu Glu Ala Glu Lys Ala Ser Pro His Pro Ser Pro Gly Pr 
435 440 445 

Cys Arg Pro Pro His Gly Thr Gin Ala Pro Gly Leu Ala Lys Ala Arg 
450 455 460 

fier Leu Ser Val Gin Hie Met Ser Ser Pro Gly Glu Ala Val Glu Gly 
4^5 470 475 480 

Gly Val Arg Cys Arg Ser Arg Ser lie Gin Tyr Cye Val Pro Arg Asp 
465 490 495 

Asp Ala Ala Pro Glu Ala Asp Gly Gin Ala Ala Gly Ala Leu Ala Ser 
500 505 510 

Arg Asn Ser His Ser Ala Glu Leu Pro Pro Pro Aep Gin Pro Ser Pro 
SIS- 520 525 

Cye Lys Cya Thr Cys Lye Lys Glu Pro Ser Ser Val Ser Pro Ser Ala 
530 535 540 

Thr Val Lys Thr Arg Sex Thr Lys Ala Pro Pro Pro His Leu Pro Leu 
545 550 555 560 

Ser Pro Ala Leu Ser Arg Ala Val Glu Gly Val Gin Tyr lie Ala Asp 
565 570 575 

His Leu Lys Ala Glu Asp Thr Asp Phe Ser Val Lys Glu Aep Tip Lys 
Sao 5B5 590 

Tyr Val Ala Met Val lie Asp Arg He Phe Leu Trp Met Phe He He 
595 600 605 

Val Cys Leu Leu Gly Thr Val Gly Leu Phe Leu Pro Pro Trp Leu Ala 
610 615 620 

Gly Met He 
625 

(2) INFORMATION POR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 182 B base pairs 

(B) TYPE: nucleic acid 
iC) STKANDEDMESS : both 
(D) TOPOLOGYi both 

(ii) MOLECULE TYPE: CDMA 

(ix) FEATURE I 

<A) NAME/KEY: CDS 

(B) LOCATION: 155. .1561 

(D) OTHER INFORMATION: /product^ "ALPHA-5 SUBUNIT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7i 

CCCGGCGGGA GCTGTGGCGC GGAGCGCCCC CGCTGCTGCG TCTGCCCTCG TTTTGTCTCA €0 

CGACTCACAC TCAGTGCTGC ACTCCCCAAG AGTTCGCGTT CCCCCCGCQQ CCGTCGAGAG 120 

OCGGCTGCCC GCGGTCCCGC GCGGQCGCGG GGCG ATG GCG 6CG CGC GOG TCA 172 

Met Ala Ala Arg Gly Ser 
1 5 



GGG CCC CGC GCG CTC CGC CTG CTG CTC TTG GTC CAG CTG GTC CCG GGG 220 
Gly Pro Arg Ala Leu Arg Leu I, u Leu Leu Val Gin Leu Val Ala Gly 
10 15 20 

CGC TGC GGT CTA GCG GGC GCG GCG CGC GGC GCG CAG AGA GOA TTA TCT 268 
Arg Cys Gly Leu Ala Gly Ala Ala Gly Gly Ala Gin Arg Gly Leu Ser 
25 30 35 

GAA CCT TCT TCT ATT GCA AAA CAT GAA GAT AGT TTG CTT AAG GAT TTA 31g 
Glu Pro Ser Ser He Ala Lys His Glu Asp Ser Leu Leu Ly« Asp Leu 
40 45 50 

TTT CAA GAC TAC GAA AGA TGO GTT CCT CCT GTG GAA CAC CTG AAT GAC 364 
Phe Gin Asp Tyx Glu Arg Trp Val Arg Pro Val Glu His Leu Asn Asp 
55 60 65 70 



AAA ATA AAA ATA AAA TTT GGA CTT GCA ATA TCT CAA TTG GTG GAT GTG 
Lye He Lys He Lys Phe Gly Leu Ala He Ser Gin Leu Val Asp Val 
75 80 85 



412 



GAT GAG AAA AAT CAG TTA ATG ACA AGA AAC GTC TGG TTG AAA CAG GAA 460 
Asp Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu 
90 95 100 

TGG ATA GAT GTA AAA TTA AGA TGG AAC CCT GAT CAC TAT GGT GGA ATA 508 
Trp He Asp Vsl Lye Leu Axg Trp Asn Pro Asp Asp Tyr Gly Gly He 
105 110 115 



AAA GTT ATA CGT GTT CCT TCA GAC TCT CTC TGG ACA CCA GAC ATC GTT 
Lys Val He Arg Val Pro Ser Asp Ser Val Trp Thr Pro Asp He Val 
120 125 130 



556 



TTG TTT GAT AAT GCA GAT GGA CGT TTT GAA GGG ACC AGT ACG AAA ACA 
Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu Gly Thr Ser Thr Lys Thr 
135 140 145 ISO 



e04 



GTC ATC AGO TAC AAT GGC ACT GTC ACC TGG ACT CCA CCG GCA AAC TAC 
Val He Arg Tyr Asn Gly Thr Val Thr Trp Thr Pro Pro Ala Asn Tyr 
155 160 IBS 



652 



AAA AGT TCC TGT ACC ATA GAT GTC ACG TTT TTC CCA TTT GAC CTT CAG 
Lys Ser Ser Cys Thr He Asp Val Thr Phe Phe Pro Phe Asp Leu Gla 
170 175 160 



700 



AAC TGT TCC ATG AAA TTT GGT TCT TGG ACT TAT GAT GGA TCA CAG GIT 
Asn Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Ser Gin Val 
185 190 195 



746 



GAT ATA ATT CTA GAG GAC CAA GAT GTA GAC AAG AGA GAT TTT TTT OAT 
Asp He He Leu Glu Asp Oln Asp Val Asp Lys Arg Asp Phe Phe Asp 
200 205 210 



796 



AAT GGA GAA TGG GAG ATT GTG AGT GCA ACA GGG AGC AAA GGA AAC AGA 
Asn Gly Glu Trp Glu He Val Ser Ala Thr Gly Ser Lys Gly Asn Arg 
215 220 225 230 



644 



ACC GAC AGC TGT TGC TGG TAT CCG TAT GTC ACT TAC TCA TTT GTA ATC 892 
Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val Thr Tyr Ser Phe Val He 
235 240 245 

AAG CGC CTG CCT CTC TTT TAT ACC TTG TTC CTT ATA ATA CCC TGT ATT 940 
Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys He 
250 255 260 



GOG CTC TCA TTT TTA ACT GTA CTT GTC TTC TAT CTT CCT TCA AAT GAA 
Gly Leu Ser Phe Leu Thr Val Leu Val Ph Tyr Leu Pro Ser Asn Glu 
265 270 275 



966 



GGT GAA AAG ATT TGT CTC TGC ACT TCA GTA CTX GTG TCT TTG ACT GTC 1036 
Giy Glu Lys lie Cys Leu Cys Thr Ser Val Leu Val Sex Leu Thr Val 
280 285 290 

TTC CTT CTG GTT ATT GAA GAG ATC ATA CCA TCA TCT TCA AAA GTC ATA 1084 
Phe Leu Leu Val lie Glu Glu He lie Pro Ser Ser Ser Lye Val He 
29S 300 305 310 

CCT CTA kTT GGA GAG TAT CTG GTA TTT ACC ATG ATT TTT GTG ACA CTG 1132 
Pro Leu He Gly Glu Tyr Leu Val phe Thr Met He Phe Val Thr Leu 
315 320 325 

TCA ATT ATG GTA ACC GTC TTC GCT ATC AAC ATT CAT CAT CGT TCT TCC HBO 
Ser lie Met Val Thr Val Phe Ala He Acn He Hie Hia Arg Ser Ser 
330 335 340 

TCA ACA CAT AAT CCC ATG GCG CCT TTG GTC CGC AAG ATA TTT CTT CAC 1228 
ser Thr His Asn Ala Met Ala Pro Leu Val Arg Lys He Phe Leu His 
345 350 355 

ACG CTT CCC AAA CTG CTT TGC ATG AGA AGT CAT GTA GAC AGG TAC TTC 12 76 

Thr Leu Pro Lye Leu Leu Cys Met Arg Ser His Val Asp Arg Tyr Phe 
360 365 370 

ACT CAQ AAA GAG GAA ACT GAG AGT GGT AGT GGA CCA AAA TCT TCT AGA 1324 
Thr Glr. Lys Glu Glu Hir Glu Ser Gly Ser Gly Pro Lys Ser ser Arg 
375 380 385 390 

AAC ACA TTG GAA GCT GCG CTC AAT TCT ATT CGC TAC ATT ACA AGA CAC 1372 
Asn Thr Leu Glu Ala Ala Leu Aan Ser He Arg Tyr He Thr Arg His 
395 400 405 

ATC ATG AAG GAA AAT GAT GTC CGT GAG GTT GTT GAA GAT TGG AAA TTC 1420 
He Met Lys Glu Asn Asp Val Arg Glu Val Val Glu Asp Trp Lys Phe 
410 415 420 

ATA GCC CAG GTT CTT GAT CGG ATG TTT CTG TGG ACT TTT CTT TTC GTT 1468 
He Ala Gin Val Leu Asp Arg Met Phe Leu Trp Thr Phe Leu Phe Val 
425 430 435 

TCA ATT GTT GGA TCT CTT GGG CTT TTT GTT CCT GTT ATT TAT AAA TGG 1516 
Ser He Val Gly Ser Leu Gly Leu Phe Val Pro Val He Tyr Lya Trp 
440 445 4S0 

GCA AAT ATA TTA ATA CCA GTT CAT ATT GGA AAT GCA AAT AAG TGAAGCCTCC 1568 
Ala Asn He Leu He Pro Val His He Gly Asn Ala Asn Lys 
455 460 465 

CAAGGGACTG AAGTATACAT TTAGTTAACA CACATATATC TGATGGCACC TATAAAATTA 1628 

TGAAAATGTA AGTTATGTGT TAAATTTAGT GCAAGCTTTA ACAGACTAAG TTGCTAACCT 1688 

CAATTTATGT TAACAGATGA TCCATTTGAA CAGTTGGCTG TAT5ACTGAA GTAATAACTG 1746 

ATGAGATACA TTTGATCTTG TAAAAATAGC AAAATATTAT CTGAACTGGA CTAGTGAAAA 1808 

ATCTAQTATT TGTATCCTGG 1628 

(2) INFORMATION FOR SEQ ID NOiB: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH I 468 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 



(li) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTIOK: SEQ ID NOrB: 
Met Ala Ala Arg Gly Ser Gly Pro Arg Ala Leu Arg Leu Leu Leu L u 

Val Cln Leu Val Ala Gly Arg Cys Gly Leu Ala Gly Ala Ala Gly Gly 
20 25 30 

Ala Gin Arg Gly Leu Ser Glu Pro Ser Ser lie Ala Lye His Glu Asp 
35 40 45 

Ser Leu Leu Lys Aep Leu Phe Gin Asp Tyr Glu Arg Trp Val Arg Pro 
SO 55 50 

Val Glu His Leu Asn Asp Lys lie Lye lie Lys Phe Gly Leu Ala lie 
^5 70 75 80 

Ser Gin Leu Val Asp Val Asp Glu Lys Asn Gin Leu Met Thr Thr Aan 
85 $0 95 

Val Trp Leu Lys isin Glu Trp lie Asp Val Lye Leu Arg Trp Asn Pro 
100 105 110 

Asp Asp Tyr Gly Gly lie Lye Val He Arg Val Pro Ser Asp Ser Val 
115 120 125 

Trp Thr Pro Asp He Val Leu Phe Asp Asn Ala Asp Gly Arg Phe Glu 
130 135 140 

Gly Thr Ser Thr Lys Thr Val He Arg Tyr Asn Gly Thr Val Thr Trp 
150 155 160 

Thr Pro Pro Ala Asn Tyr Lys Ser Ser Cys Thr He Asp Val Thr Phe 
165 170 175 

Phe Pro Phe Asp Leu Gin Aan Cys Ser Met Lys Phe Gly Ser Trp Thr 
160 1B5 ISO 

Tyr Asp Gly Ser Gin Val Asp He He Leu Glu Asp Gin Asp Val Asp 
ISS 200 205 

Lys Arg Asp Phe Phe Asp Asn Gly Glu Trp Glu He Val Ser Ala Thr 
210 215 220 

Gly Ser Lys Gly Aen Arg Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val 
225 230 235 240 

Thr Tyr Ser Phe Val He Lye Arg Leu Pro Leu Phe Tyr Thr Leu Phe 
245 250 255 

Leu He He Pro Cys He Gly Leu Ser Phe Leu Thr Val Leu Val Phe 
260 265 270 

Tyr Leu Pro Ser Asn Glu Gly Glu Lys He Cys Leu Cys Thr Ser Val 
275 260 285 

Leu Val Ser Leu Thr Val Phe Leu Leu Val He Glu Glu He He Pro 
2S0 295 300 

Ser Ser Ser Lye Val He Pro Leu He Gly Glu Tyr Leu Val Phe Thr 
305 310 315 320 

Met He Phe Val Thr Leu Ser He Met Val Thr Val Phe Ala He Asn 
325 330 335 

He His Hie Arg Ser Ser Ser Thr His Asn Ala Met Ala Pro Leu Val 
340 . 345 350 



Arg Lys lie Phe Leu His Thr Leu Pro Lye Leu Leu Cye Met Arg S r 

"i^A 



360 



365 



His Val Asp Arg Tyr Phe Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser 

375 330 

Gly Pro Lys Ser Ser Arg Asn Thr Leu Glu Ala Ala Leu Aan Ser He 



390 



395 



400 



Arg Tyr He Thr Arg His He Met Lys Glu Asn Asp Val Arg Glu Val 
405 410 415 

Val Glu Asp Trp Lys Phe He Ala Gin Val Leu Asp Arg Met Phe Leu 
420 425 430 

Trp Thr Phe Leu Phe Val Ser He Val Gly Ser Leii Gly Leu Phe Val 
435 440 445 

Pro Val He Tyx Lya Trp Ala Asn He Leu He Pro Val His He Gly 

a^n jitff 



450 

Asn Ala Asn Lys 
465 



455 



460 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 1743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : boCh 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDKA 

(ix) FEATURE: 

(A) NAME/XSY: CDS 

(B) LOCATION: 143.. 1627 

(D) OTHER INFORMATION: /product a "ALPHA- 6 SUBUNIT" 
(xi) SEQUENCE DESCRIPTIONi SEQ ID N0:9x 



CGGGTTTTGA TTTCTGAGAA GACACACACG GATTGCAGTG GGCTTCTGAT CATGTCAAGG 60 

TTGGATGCAT GTGGCTGACT GATAGCTCTT TGTTTTCCAC AATCCTTTGC CTAGGAAAAA 120 

GGAATCCAAG TGTGTTTTAA CC ATG CTG ACC AGC AAG GGQ CAG GGA TTC CTT 172 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu 
is 10 



StI ^ ™^ ^'^^ ^ ore ACA CCT TTC TTT AAA 220 

Hifl Gly Gly Leu Cye Leu Trp Leu Cye Val Phe Thr Pro Phe Phe Lys 
15 20 25 

r?5, JT? ^ ^ ^ CTC TTC CAC AAA CTG TTT 266 

Gly Cys Val Gly Cys Ala Thr Glu Glu Arg Leu Phe His Lya Leu Phe 
30 35 40 

TCT CAT TAC AAC CAG TTC ATC AGO CCT GTG GAA AAC GTT TCC GAC CCT 316 
Ser His Tyr Asn Gin Phe He Arg Pro Val Glu Asn Val Ser Asp Pro 
45 50 55 



364 



GAA GTA AAC CAG ATC ATG GAA ACC AAT TTG TGG CTG CGT CAC ATC TGG 4X2 
Glu Val Asn Gin He Met Glu Thr Aan Lsu Trp L u Arg His He Trp 
''S 80 85 90 

AAT GAT TAT AAA TTG CGC TGG OAT CCA ATG GAA TAT GAT GGC ATT GAG 460 
Asn Aep Tyr Lys Leu Arg Trp Asp Pro Met Glu Tyr Asp Gly He Glu 
95 100 105 

ACT CTT CGC GTT CCT GCA GAT AAG ATT TGG AAG CCC GAC ATT GTT CTC 508 
Thr Leu Arg Val Pro Ala Asp Lys He Trp Lys Pro Asp He Val Leu 
110 lis 120 

TAT AAC AAT GCT GTT GGT GAC TTC CAA GTA GAA GGC AAA ACA AAA GCT 556 
Tyr Asn Asn Ala Val Gly Asp Phe Gin Val Glu Gly Lys Thr Lys Ala 
125 130 135 

CTT CTT AAA TAC AAT GGC ATG ATA ACC TGG ACT CCA CCA GCT ATT TTT 604 
Leu Leu Lye Tyr Asn Gly Met He Thr Trp Thr Pro Pro Ala He Phe 
1^0 145 150 

AAG AGT TCC TGC CCT ATG GAT ATC ACC TTT TTC CCT TTT GAT CAT CAA 652 
Lys Ser Ser Cys Pro Met Aep He Thr Phe Phe Pro Phe Asp His Gin 
155 160 165 170 

AAC TGT TCC CTA AAA TTT GGT TCC TGG ACG TAT GAC AAA GCT GAA ATT 700 
Asn Cys Ser Leu Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Glu He 
175 180 165 

GAT CTT CTA ATC ATT GGA TCA AAA GTG GAT ATG AAT GAT TTT TGG GAA 74 B 

Asp Leu Leu He He Gly Ser Lys Val Asp Met Asn Asp Phe Trp Glu 
190 195 200 

AAC AGT GAA TGG GAA ATC ATT GAT GCC TCT GGC TAC AAA CAT GAC ATC 796 
Asn Ser Glu Trp Glu He He Asp Ala Ser Gly Tyr Lys Hia Asp He 
205 210 215 

AAA TAC AAC TGT TGT GAA GAG ATA TAC ACA GAT ATA ACC TAT TCT TTC 844 
Lys Tyr Asn Cys Cys Glu Glu He Tyr Thr Asp He Thr Tyr Ser Phe 
220 225 230 

TAC ATT AGA AGA TTG CCG ATG TTT TAC ACG ATT AAT CTG ATC ATC CCT 892 
Tyr He Arg Arg Leu Pro Met Phe Tyr Thr He Asn Leu He He Pro 
235 240 245 250 

TGT CTC TTT ATT TCA TTT CTA ACC GTG TTG GTC TTT TAC CTT CCT TCG 940 
Cya Leu Phe He Ser Phe Leu Thr Val Leu Val Phe Tyr I*ea Pro Ser 
255 360 265 

GAC TGT GGT GAA AAA GTG ACG CTT TGT ATT TCA GTC CTG CTT TCT CTG 986 
Asp Cys Gly Glu Lya Val Thr Leu Cys He Ser Val Leu Leu Ser Leu 
270 275 280 

ACT GTG TTT TTG CTG GTC ATC ACA GAA ACC ATC CCA TCC ACA TCT CTQ 1036 
Thr Val Phe Leu Leu Val He Thr Glu Thr He Pro Ser Thr Ser Leu 
285 290 295 

GTG GTC CCA CTG GTG GGT GAG TAC CTG CTG TTC ACC ATG ATC TTT GTC 1084 
Val Val Pro Leu Val Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val 
300 305 310 

ACA CTG TCC ATC GTG GTG ACT GTG TTT GTG TTG AAC ATA CAC TAC CGC 1132 
Thr Leu Ser He Val Val Thr Val Phe Val Leu Asn He His Tyr Arg 
315 320 325 330 

ACC CCA ACC ACG CAC ACA ATG CCC AGG TGG GTG AAC ACA GTT TTC CTG 1180 
Thr Pro Thr Thr His Thr Met Pro Arg Trp Val Lys Thr Val Phe Leu 
335 340 345 



AAG CTG CXG CCC CAG GTC CTG CTG ATG AGG TGG CCT CTG GAC AAG ACA 1228 
Lys Leu Leu Pro Gin Val Leu Leu Met Arg Trp Pro Leu Asp Lys Thr 
350 355 



^ ^ ^"^^ ^ CTT GCC AGG AGO CCT 1276 

Arg Gly xhr Gly Ser Aap Ala Val Pro Arg Gly Leu Ala Arg Arg Pro 
365 370 375 

^ ^ CCC AGA CAT CTT AAA GAA 1324 

Ala Lys Gly Lys Leu Ala Ser His Gly Glu Pro Arg His Leu Lye Glu 
. 380 385 390 

^^f 11^ ^ ^ *3CC ACA AGC AAG AGA AGA 1372 

Cys Phe His Cys His Lys Ser Asn Glu Leu Ala thr Ser Lye Arg Arg 
400 405 410 

TTA AGT CAT CAG CCA TTA CAG TGG GTG GTG GAA AAT TCG GAG CAC TCG 1420 
Leu Ser His Gin Pro Leu Oln Trp Val Val Glu Asn Ser Glu His Ser 
.415 420 425 

CCT GAA GTT GAA GAT GTG ATT AAC AGT GTT CAG TTC ATA GCA GAA AAC 1468 
Pro Glu Val Glu Asp Val He Asn Ser Val Gin Phe lie Ala Glu Asn 
430 435 440 

ATG A-'.G AC<! GAC AAT GAA ACC AAG GAG GTA GAA GAT GAC TGO AAA TAc ISlfe 
Met Lys Ser His Asn Glu Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr 
445 450 455 

5™ ?^ ^^9 ^ TGG GTA TTT ATA ATT GTC 1564 

Val Ala Met Val Val Asp Arg Val Phe Leu Trp Val Phe He He Val 
460 465 ^ijQ 

TGT GTA TTT GGA ACT GCA GGG CTA TTT CTA CAG CCA CXA CTT GGG AAC 1612 
Cys Val Phe Gly Thr Ala Gly Leu Phe Leu Gin Pro Leu Leu Gly Asn 
480 485 450 

ACA GGA AAA TCT TAAAATGTAT TTTCTTTTAT GTTCAGAAAT TTACACACAC 1664 
Thr Gly Lye Ser 

CATATTTGTT CTGCATTCCC TGCCACAAGG AAAGGAAAGC AAAGGCTTCC CACCCAAGTC 1724 
CCCCATCTGC TAAAACCCG 



(2) INPORKATTON FOR SEQ ID NO: 10: 

(i) SEQUBHCE CHARACTBRISTICS : 

(A) LENGTH: 4 34 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY t linear 

(ii) MOLECULS TYPE: protein 

(Xi) SEQUENCE 0ESCRIPTIOK: SEQ ID NO; 10: 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu His Gly Gly Leu Cys Leu 
^ S 10 15 

Trp Leu Cys Val Phe Thr Pro Phe Phe Lys Gly Cys Val Gly Cys Ala 
20 25 30 

Thr Glu Glu Arg Leu Phe His Lys Leu Phe Ser His Tyr Asn Gin Phe 
35 40 45 

He Arg Pro Val Glu Asn Val Ser Asp Pro Val Thr Val His Ph Glu 
50 55 60 



1743 



Val Ala He Thr Gin Leu Ala Asn Val Asp Glu Vsl Asn Gin He Met 
65 70 75 80 



Qlu Thr Asn Leu Trp Leu Arg His II Trp Asn Asp Tyr Lys Leu Arg 
B5 90 95 

Trp Asp Pro Met Glu Tyr Asp Gly He Glu Thr Leu Arg Val Pro Ala 
100 105 110 

Asp Lys He Trp Lys Pro Asp He Val Leu Tyr Aen Aen Ala Val Gly 
115 120 125 

Asp Phe Gin Val Glu Gly Lye Thr Lys Ala Leu Leu Lys Tyr Asn Gly 
130 135 140 

Met He Thr Trp Thr Pro Pro Ala He Phe Lye Ser Ser Cys Pro Met 
145 150 155 160 

Asp He Thr Phe Phe Pro Phe Asp His Gin Asn Cys Ser Leu Lys Phe 
165 170 175 

Gly Ser Trp Thr Tyr Asp Lys Ala Glu He Asp Leu Leu He He Gly 
180 185 190 

Ser Lye Val Asp Met Aan Asp Phe Trp Glu Asn Ser Glu Trp Glu He 
195 200 205 

He Asp Ala Ser Gly Tyr Lys Kis Asp He Lys Tyr Asn Cys Cys Glu 
210 215 220 

Glu He Tyr Thr Asp He Thr Tyr Sex Phe Tyr He Arg Arg Leu Pro 
225 230 235 240 

Met Phe Tyr Thr He Asn Leu He He Pro Cys Leu Phe He Ser Phe 
245 250 2S5 

Leu Thr Val Leu Val Phe Tyr Leu Pre Ser Asp Cys Gly Glu Lys Val 
260 265 270 

Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Val 
275 280 285 

He Thr Glu Thr He Pro Ser Thr Ser Leu Val Val Pro Leu Val Gly 
290 295 300 

Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser He Val Val 
305 310 315 320 

Thr Val Phe Val Leu Aen He His Tyr Arg Thr Pro Thr Thr His Thr 
325 330 335 

Met Pro Arg Trp Val Lys Thr Val Phe Leu Lys Leu Leu Pro Gin Val 
340 345 350 

Leu Leu Met Arg Trp Pro Leu Asp Lys Thr Arg Gly Thr Gly Ser Asp 
355 360 365 

Ala Val Pro Arg Gly Leu Ala Arg Arg Pro Ala Lys Gly Lys Leu Ala 
370 37S 380 

Ser His Gly Glu Pro Arg His Leu Lys Glu Cys Phe Bis Cys His Lys 
385 390 395 400 

Ser Asn Glu Leu Ala Thr Ser Lys Arg Arg Leu Ser His Gin Pro Leu 
405 410 415 



Gin 



Trp Val Val Glu Asn Ser Clu His Ser 
420 425 



Pro Glu Val 



Glu Asp Val 
430 



He 



Aan Ser Val Gin Phe He Ala Glu Adh 
435 440 



Met Lys Ser 
445 



His Asn Glu 



Thr 



Lys Glu Val Glu Aj?p Aep Trp Lys Tyr 
450 455 



Val Ala Met 
460 



Val Val Asp 



Arg 
4£5 



Val Phe Leu Trp Val Phe He He Val 
470 



Cys Val Phe 
475 



Gly Thr Ala 
4Q0 



Gly 



Leu Phe Leu Gin Pro Leu Leu Gly Asn 
4S5 490 



Thr Gly Lyc 



C2) INFORMATION FOR SEO ID N0;11; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

(B) TVPB: nucleic acid 

(C) STRANDEDNESS I both 

(D) TOPOLOGY I both 

(ii) MOLECULE TYPEi cDNA 

(ixt FEATURE: 

(A) NAME/XEY: CDS 

(B) LOCATION: 73.. 1581 

(D) OTHER INFORMATION: /products "ALPHA- 7 SUBUNIT" 



(xi) SEQUENCE DESCRIPTION I SSQ ID NO ill; 



GGCCGCAGGC 


CCAGGCCCGG 


GCGACAGCCG 


AGACGTGGAG COCGCCGGCT 


CGCTGCAQCT 


€0 


CCGGGACTCA 


ACATGCGCTG 


CTCGCCGGGA 


CGCCTCTGGC TGGCGCTGGC 


CGCGTCGCTC 


120 


CTGCACGTGT 


CCCTGCAAGG 


CGAGTTCCAG 


AGGAAGCTTT ACAAGGAGCT 


GGTCAAGAAC 


180 


TACAATCCCT 


TQGAGAGGCC 


CGTGGCCAAT 


GACTCGCAAC CACTCACCGT 


CTACTTCTCC 


240 


CTGAGCCTCC 


TGCACSATCAT 


GGACGTGGAT 


GAGAAGAACC AAGTTTTAAC 


CACCAACATT 


300 


TGGCTQCAAA 


TGTCTTGGAC 


AGATCACTAT 


TTACAGTGGA ATGTGTCAGA 


ATATCCAGGG 


360 


GTGAAGACTG 


TTCGTTTCCC 


AGATGGCCAG 


ATTTGGAAAC CAGACATTCT 


TCTCTATAAC 


420 


AQTGCTGATQ 


AGCOCTTTGA 


CGCCACATTC 


CACACTAACG TGTTGGTGAA 


TTCTTCTGGG 


460 


CATTGCCAGT 


ACCTGCCTCC 


AQGCATATTC 


AAQAQTTCCT GCTACATCGA 


TGTACGCTGG 


540 


TTTCCCTTTG 


ATGTGCAGCA 


CTGCAAACTG 


AAGTTTGGGT CCTGGTCTTA 


CGGAGGCTGG 


600 


TCCTTGGATC 


TGCAGATGCA 


GGAGGCAGAT 


ATCAGTGGCT ATATCCCCAA 


TGGAGAATGG 


660 


GACCTAGTGG 


GAATCCCCGG 


CAAGAGGAGT 


GAi^GGTTCT ATGAGTGCTG 


CAAAGAGCCC 


720 


XACCCCGATO 


TCACCTTCAC 


AGTGACCATQ 


CGCCGCAGGA CGCTCTACTA 


XGGCCTCAAC 


780 


CTGCTGATCC 


CCTGTGTGCT 


CATCTCCGCC 


CTCGCCCTGC TGGTGTTCCT 


GCTTCCTGCA 


640 


GATTCCGOGQ 


AQAAQATTTC 


CCTGGGGATA 


ACAGTCTTAC TCTCTCTTAC 


CGTCTTCATG 


900 


CTGCTCGTGG 


CTGAGATCAT 


GCCCGCAACA 


TCCGATTCGG TACCATTGAT 


AGCCCAGTAC 


960 


TTCOCCAGCA 


CCATGATCAT 


CGTGGCCCTC 


TCGGTGGTGG TGACGGTGAT 


CGTGCTGCAG 


1020 



• 





AUGACCCuGA 


CGQGGGCAAG 


ATGCCCAAGX GGALCAwttjl 




^ V 0 u 


iiAU 1 WW I V 




ouOAAiGAAb 




p cnr^fYifirf 

U w iwUrVj w 


1140 




AL#UAI^C9(^W 


CTGCAGCCTG 


GCCAGTGTGG AWiiWU»UUt. 




1 9 n ft 


CCGCCCGCCA 


GCAACGGGAA 


CCTGCTGTAC 


ATCGGCTTCC GCGGCCTGGA 


PATtPPTV^PllP 


1 « 0 V 




CCCCCGACTC 


TGGGGTAGTG 


7GTGGCCGCA TGGwwlXiviw 


V V i.-n.LtL9Li/iw< 


1 1 5 n 

X J3 ^ U 


QATGAGCACC 


TCCTGCACGG 


CGGGCAACCC 


CCCGAGGGGG ACCCGGACTT 


ft/^PPL ^(^^TP 




CTGGAGGAGG 


TCCGCTACAT 


TGCCAATCGC 


TTCCGCTGCC AGGACCaAAAb 


LuAuw^uva X ^ 


1 ft 


TGCAGCGAGT 


ggaagttcgc 


CGCCTGXGTG 


GTGGACCGCC TGXGCCTCAT 


GGCCTTCTCG 




GTCTTCACCA 


TCATCTGCAC 


CATCGGCATC 


CTGATGTCGG CTCCCAACTT 


CGTGGAGGCC 




GXGTCCAAAQ 


AC J- 1 CGTA 


ACCACGCCTG 


GTTCTGTACA TGTGGAAAAC 


TCACAGATGG 


1520 


GCAAGGCCTT 


TGGCTTGGCG 


AGATTTGGGG 


GTGCTAATCC AGGACAGCAT 


TACACGCCAC 


1680 


AACTCCAGTG 


TTCCCTTCTG 


GCTCTCAGTC 


GTGTTGCTTA CGGTTTCTTT 


GTTACTTTAG 


1740 


GTAGTAGAAT 


CTCAGCACTT 


TGTTTCATAT 


tCTCAGATGG GCTGATAGAT ATCCTTGGCA 


1100 


CATCCGTACC 


ATCGGTCAGC 


AGGGCCACTG 


A3TAGTCATT TTGCCCATTA 


GCCCACTGCC 


1860 


TGGAAAGCCC 


TTCGGA 








1876 


(2) INFORMATION FOR SEQ ID NO; 12 









(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 502 amino acids 
(B) TYPE: amino acid 
CD) TOPOZ-OGY: un3mown 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID N0:12» 

Met Arg Cys S&t Pro Gly Gly Val Trp Leu Ala Leu Ala Ala Ser Leu 
15 10 15 

Leu His VaZ Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu Tyr Lys Glu 
20 25 30 

Leu Val Lys Aan Tyr Asn Pro Leu Glu Arg Pro Val Ala Asn Aep Ser 
35 40 45 

Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin He Met Asp 
50 55 60 

Val Asp Glu Lva Asn Gin Val Leu Thr Thr Asn lie Trp Leu Gin Met 
65 70 75 60 

Ser Trp Thr Asp Hie Tyr Leu Gin Trp Asn Val Ser Glu Tyr Pro Gly 
85 90 95 

Val Lys Thr Val Arg Phe Pro Aap Gly Gin lie Trp Lys Pro Aap lie 
100 105 110 

Leu Leu Tyr Aen Ser Ala Asp Glu Arg Phe Asp Ala Tlir Phe His Thr 
lis 120 125 

Asn Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Le?u Pro Pro Gly 
130 135 140 



He Phe Lye Ser Ser Cys Tyr He Asp Val Arg Trp Phe Pro Phe ABp 
145 150 155 160 

V I Gin HiB Cys hys Leu Lye Phe Gly Ser Trp Ser Tyr Gly Gly Trp 
165 170 175 

Ser Leu Asp Le\i Gin Met Gin Glu Ala Asp He Ser Gly Tyr He Pro 
180 185 190 

Aen Gly Glu Trp Asp Leu Val Gly He Pro Gly Lye Arg Ser Glu Arg 
195 200 205 

Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr Phe Thr Val 
210 215 220 

Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Aen Leu Leu He Pro 
225 230 235 240 

Cys Val Leu He Ser Ala Leu Ala Leu Leu Val Phe Leu Leu Pro Ala 
245 250 255 

Asp Ser Gly Glu Lys He Ser Leu Gly He Thr Val Leu Leu Ser Leu 
260 26S 270 

Thr Val Phe Met Leu Leu Val Ala Glu He Met Pro Ala Thr Ser Asp 
275 280 285 

ser Val Pro Leu He Ala Gin Tyr Phe Ala Ser Thr Met He He Val 
2S0 295 300 

Gly Leu Ser Val Val Val Thr Val He Val Leu Gin Tyr His His Hie 
305 310 315 320 

Asp Pro Asp Gly Gly Lys Met Pro Lya Trp Thr Arg Val He Leu Leu 
325 330 335 

Asn Trp Cys Ala Trp Phe Leu Axg Met Lys Arg Pro Gly Glu Asp Lys 
340 345 350 

Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser Leu Ala Ser 
355 360 365 

Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asa Gly Asn Leu 
370 375 380 

Leu Tyr He Gly Phe Arg Gly Leu Asp Gly Val His Cys Val Pro Thr 
385 350 395 400 

Pro Asp Ser Gly val Val Cys Gly Arg Met Ala Cye Ser Pro Thr His 
405 410 415 

Abp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly Asp Pro Asp 
420 . 425 430 

Leu Ala Lys He Leu Glu Glu Val Arg Tyr He Ala Asn Arg Phe Arg 
435 440 445 

Cys Gin Asp Glu Ser Glu Ala Val Cye Ser Glu Trp Lys Phe Ala Ala 
450 455 460 

Cys Val Val Asp Arg Lsu Cys Leu Met Ala Phe Ser Val Phe Thr He 
465 470 475 480 



He Cys Thr He Gly He Leu Met Ser Ala Pro Asn Phe Val Glu Ala 
485 490 495 



• 



Val Ser Lys Asp Phe Ala 
SOO 

(2) INFORMATION FOR SEQ ID N0il3i 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2448 base pairB 

(B) TYPE: nucleic acid 

(C) STRANBEDNESS: both 

(D) TOPOLOGY; both 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAr4E/KEY: CDS 

(B) LOCATION: 265. .1773 

(D) OTHER INFORMATION: /product= "BETA-2 SUBUNIT** 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCTCCCCC TCACCGTCCC AATTGTATTC CCTGGAAGAG CAGCCGGAAA AGCCTCCGCC 60 

TGCTCATACC AGGATAGGCA AGAAGCTCGT TTCTCCTCGC AGCCOGCTCC CTGAGGCCCA 120 

GGAACCACCG CGGCGGCCGG CACCACCTGG ACCCAGCTCC AGGCGGGCGC GGCTTCAGCA 180 

CCACGGACAG CGCCCCACCC GCGGCCCTCC CCCCGGCGGC GCGCTCCAGC CGGTGTAGGC 240 

GAGGCAGCGA GCTATGCCCG CGGC ATG GCC CGG CGC TGC GGC CCC GTG GCG 291 

Met Ala Arg Arg Cys Qly Pro Val Ala 
1 5 

CTG CTC CTT GGC TTC GGC CTC CTC CGG CTG TGC TCA GGG GTG TGG GOT 339 
Leu Leu Leu Gly Phe Gly Leu Leu Arg Leu Cys Ser Gly Val Trp Gly 
10 15 20 25 

ACG GAT ACA GAG GAG CGG CTG GTG GAG CAT CTC CTG GAT CCT TCC CGC 387 
Thr Asp Thr Glu Glu Arg Leu Val Glu His Leu Leu Asp Pro Ser Arg 
30 35 40 

TAC AAC AAG CTT ATC CGC CCA GCC ACC AAT GGC TCT GAG CTG GTG ACA 435 
Tyr Asn Lys Leu lie Arg Pro Ala Thr Asn Gly Ser Glu Leu Val Thr 
45 50 55 

GTA CAG CTT ATG QTG TCA CTQ GCC CAG CTC ATC AGT GTG CAT GAG CGG 483 
Val Gin Leu Met Val Ser Lea Ala Gin Leu lie Ser Val Hie Glu Arg 
60 65 70 

GAG CAG ATC ATG ACC ACC AAT GTC TGG CTG ACC CAG GAG TGG GAA GAT 531 
Glu Gin lie Met Thr Thr Aen Val Trp Leu Thr Gin Glu Trp Glu Asp 
75 80 85 

TAT CGC CTC ACC TGG AAG CCT GAA GAG TTT GAC AAC ATG AAG AAA GTT 579 
Tyr Arg Leu Thr Trp Lys Pro Glu Glu Phe Asp Asn Met Lys Lys Val 
90 95 100 105 

CGG CTC CCT TCC AAA CAC ATC TGG CTC CCA GAT GTG GTC CTG TAC AAC 627 
Arg Leu Pro Ser Lys His He Trp Leu Pro Aep Val Val Leu Tyr Aen 
110 X15 120 

AAT OCT GAC GGC ATG TAC GAG GTG TCC TTC TAT TCC AAT GCC GTG GTC 675 
Asn Ala Asp Gly Met Tyr Glu Val Ser Phe Tyr Ser Asn Ala Val Val 
125 130 135 

TCC TAT GAT GGC AGC ATC TTC TGG CTG CCG CCT GCC ATC TAC AAG AGC 723 
Ser Tyr Asp Gly Ser He Phe Trp Leu Pro Pro Ala He Tyr Lys Ser 
140 145 150 



GCA TGC AAG ATT Ohh GTA AAG CAC TTC CCA TTT QAC GAG CAG AAC TGC 771 
Ala Cys Lye lie Glu Val Lye His Phe Pro Phe Asp Gin Gin Asn Cys 
155 160 165 

ACC ATG AAG TTC CGT ICG TGG ACC TAC GAC CGC ACA GAG ATC GAC TTG 81$ 
Thr Met Lys Phe Arg Ser Trp Thr Tyr Asp Arg Thr Glu lie Asp Leu 
1.70 175 leo 185 

GTG CTG AAQ AGT GAG GTG GCC AGC CTG GAC GAC TTC ACA CCT AGT GGT 867 
Val Leu Lys Ser Glu Val Ala Ser Leu Asp Asp Phe Thr Pro Ser Gly 
190 195 200 

GAG TGG GAC ATC GTG GCG CTG CCG GGC CGG CGC AAC GAG AAC CCC GAC 915 
Glu Trp Asp lie Val Ala Leu Pro Gly Arg Arg Asn Glu Asn Pro Asp 
205 210 215 

QAC TCT ACG TAC GTG GAC ATC ACG TAT GAC TTC ATC ATT CGC CGC AAG 963 
ABp Ser Thr Tyr Val Asp lie Thr Tyr Asp Phe lie lie Arg Arg Lys 
220 225 230 

CCG CTC TTC TAC ACC ATC AAC CTC ATC ATC CCC TGT GTG CTC ATC ACC 1011 
Pro Leu Phe Tyr Thr He Aen Leu He He Pro Cys Val Leu He Thr 
235 240 245 

TCG CTA GCC ATC CTT GTC TTC TAC CTG CCA TCC GAC TGT GGC GAG AAG 1059 
Ser Leu Ala He Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys 
250 255 260 265 

ATG ACG TTG TGC ATC TCA GTG CTG CTG GCG CTC ACG CTC TTC CTG CTG 1107 
Met Thr Leu Cye He Ser Val Leu Leu Ala Leu Thr Val Phe Leu Leu 
270 275 2B0 

CTC ATC TCC AAG ATC GTG CCT CCC ACC TCC CTC GAC GTG CCG CTC GTC 1155 
Leu He Ser Lys He Val Pro Pro Thr Ser Leu Asp Val Pro Leu Val 
285 290 295 

GGC AAG TAC CTC ATG TTC ACC ATG GTG CTT GTC ACC TTC TCC ATC GTC 1203 
Gly Lys Tyr Leu Met Phe Thr Met Val Leu Val Thr Phe Ser He Val 
300 305 310 

ACC AGC GTG TGC GTG CTC AAC GTG CAC CAC CGC TCG CCC ACC ACG CAC 1251 
Thr Ser Val Cy3 Val Leu Asn Val Hie His Arg Ser Pro Thr Thr His 
315 320 325 

ACC ATG GCG CCC TGG GTG AAG GTC GTC TTC CTG GAG AAG CTG CCC GCG 1299 
Thr Met Ala Pro Trp Val Lys Val Val Phe Leu Glu Lys Leu Pro Ala 
330 335 340 345 

CTG CTC TTC ATG CAG CAG CCA CGC CAT CAT TGC GCC CGT CAG CGC CTG 1347 
Leu Leu Phe Met Gin Gin Pro Arg His His Cya Ala Arg Gin Axg Leu 
350 355 360 

CGC CTG CGG CGA CGC CAG CGT GAG CGC GAG GGC CCT GGA GCC CTC TTC 1395 
Arg Leu Arg Arg Arg Gin Arg Glu Arg Glu Gly Ala Gly Ala Leu Phe 
365 370 375 

rrC CGC GAA GCC CCA GGG GCC GAC TCC TGC ACG TGC TTC GTC AAC CGC 1443 
Phe Arg Glu Ala Pro Gly Ala Asp Ser Cya Thr Cya Phe Val Asn Arg 
380 3B5 390 

GCG TCG GTG CAG CCG TTG GCC GOG GCC TTC GCG CCT GAG CCT GCA CCA 1491 
Ala Ser Val Gin Gly Leu Ala Gly Ala Phe Gly Ala Glu Pro Ala Pro 
395 400 405 

GTTG GCG GGC CCC GGG CGC TCA GGG GAG CCG TGT GGC TGT GGC CTC CGG 1539 
Val Ala Gly Pro Gly Arg Ser Gly Glu Pro Cys Gly Cya Gly Leu Arg 
410 415 420 425 



• 



GAG GCG GTG GAC GGC GTG CCC TTC ATC GCA GAC CAC ATG CGG AGC GAG 1567 
Glu Ala Val Asp Glv Val Arg Phe He Ala Asp His Met Arg Ser Glu 
430 435 440 

GAC GAT GAC GAG AGC GTG AGT GAG GAC TGG AAG TAC GTC GCC ATG GTG 1635 
Asp Asp Aep Gin Ser Val Ser Glu Asp Trp Lys Tyr Val Ala Met Val 
445 450 455 

ATC GAC CGC CTC TTC CTC TGG ATC TTT GTC TTT GTC TGT GTC TTT GGC 1663 
He Asp Arg Leu Phe Leu Trp lie Phe Val Phe Val Cys Val Phe Gly 
460 465 470 

ACC ATC GGC ATG TTC CTG CAG CCT CTC TTC CAG AAC TAC ACC ACC ACC 1731 
Thr He Gly Met Phe Leu Gin Pro Leu Phe Gin Asn Tyr Thr Thr Thr 
475 460 4B5 

ACC TTC CTC CAC TCA GAC CAC TCA GCC CCC AGC TCC AAG TGAGGCCCTT 1780 
Thr Phe Leu Hie Ser Asp His Ser Ala Pro Ser Ser Lys 



490 


' 495 500 




CCTCATCTCC 


ATGCTCTTTC ACCCTGCCAC CCTCTCCTGC ACAGTAGTGT TGGGTGGAGG 


1640 


ATGGACGAGT 


GAGCTACCAG GAAGAGGGGC GCTGCCCCCA CAGATCCATC CTTTTGCTTC 


1900 


ATCTGGAGTC 


cctcctcccc cacgcctcca tccacacaca gcagctccaa cctggaggct 


1^60 


GGACCAACTG 


CTTTGTTTTG GCTGCTCTCC ATCTCTTGTA CCAGCCCAGO CAATAGTGTT 


2020 


GAGGAGGGGA 


GCAAGGCTGC TAAGTGGAAG ACAGAGATGG CAGAQCCATC CACCCTGAGG 


2080 


AGTGACGGGC 


AAGGGGCCAG GAAGGGGACA GGATTGTCTG CTGCCTCCAA GTCATGGGAG 


2140 


AAGAGGGGTA 


TAGGACAAGG QOTGGAAGGG CAGGAGCTCA CACCGCACCG GGCTGGCCTG 


2200 


ACACAATGGT 


AGCTCTGAAG GGAGGGGAAG AGAGAGGCCT GGGTGTGACC TGACACCTGC 


2260 


CGCT6CTTGA 


GTGGACAGCA GCTGGACTGG GTGGGCCCCA CAGTGGTCAG CGATTCCTGC 


2320 


CAAGTAGGGT 


TTAGCCGGGC CCCATGGTCA CAGACCCCTG GGGGAGGCTT CCAGCTCAGT 


2380 


CCCACAGCCC 


CTTGCTTCTA AGGGATCCAG AGACCTGCTC CAGATCCTCT TTCCCCACTG 


2440 


AAGAATTC 




244B 


(2) INPORMATICK FOR SEQ ID N0il4i 




(i) 


SEQU3NCE CHARACTERISTICS: 

(A) LENGTH 1 502 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 




(ii) 


MOLECULE TYPE: protein 




(xi) 


SEQUSNCE DESCRIPTION: SEQ ID K0:14: 





Met Ala Arg Arg Cys Gly Pro Val Ala Leu Leu Leu Gly Phe Gly Leu 
15 10 15 



Leu Arg Leu Cys Ser Gly Val Trp Gly Thr Asp Thr Glu Glu Arg Leu 
20 25 30 

Val Clu Bis Leu Leu Asp Pro Ser Arg Tyr Asn Lya Leu lie Arg Pro 
35 40 4S 

Ala Thr Asn Gly Ser Glu Leu Val Thr Val Gin Leu Met Val Ser Leu 
50 55 60 



Ala Gin L u lie Ser Val His Glu Arg Glu Qln lie Met Thr Thr Asn 
fiS 70 75 80 

Val Trp Leu Thr Gin Glu Trp Glu Asp Tyr Arg Leu Thr Trp LyB Pro 
85 90 95 

Glu Glu Phe A^p Asn Met Lys Lys Val Arg Leu Pro Ser Lye Hie He 
100 105 110 

Trp Leu Pro Asp Val Val Leu Tyr Asn Asn Ala Asp Gly Met Tyr Glu 
115 120 125 

Val Ser Phe Tyr Ser Asn Ala Val Val Ser Tyr Asp Gly Ser He Phe 
130 135 140 

Trp Leu Pro Pro Ala He Tyr Lys Ser Ala Cye Lys He Glu Val Lys 
145 150 155 160 

His Phe Pro Phe Asp Gin Gin Asn Cys Thr Met Lys Phe Arg Ser Trp 
165 170 175 

Thr Tyr Asp Arg Thr Glu He Asp Leu Val Leu Lys Ser Glu Val Ala 
180 105 1^0 

Ser Leu Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp He Val Ala Leu 
195 200 205 

Pro Gly Arg Arg Asn Glu Asn Pro Asp Asp Ser Thr Tyr Val Asp He 
210 215 220 

Thr Tyr Asp Phe He He Arg Arg Lys Pro Leu Phe Tyr Thr He Asn 
225 230 235 240 

Leu He He Pro Cys Val Leu He Thr Ser Leu Ala He Leu Val Phe 
245 250 255 

Tyr Leu Pro Ser Asp Cys Gly Glu Lys Met Thr Leu Cys He Ser Val 
260 26S 270 

Leu Leu Ala Leu Thr Val Phe Leu Leu Leu He Ser Lys He Val Pro 
275 230 265 

Pro Thr Ser Leu Asp Val Pro Leu Val Gly Lys Tyr Leu Met Phe Thr 
290 295 300 

Met Val Leu Val Thr Phe Ser He Val Thr Ser Val Cys Val Leu Asn 
305 310 315 320 

Val His His Arg Ser Pro Thr Thr His Thr Met Ala Pro Trp Val Lys 
325 330 335 

Val Val Phe Leu Glu Lys Leu Pro Ala Leu Leu Phe Met Gin Gin Pro 
340 345 350 

Arg His His Cys Ala Arg Gin Arg Leu Arg Leu Arg Arg Arg Gin Arg 
355 360 365 

Glu Arg Glu Gly Ala Gly Ala Leu Phe Phe Arg Glu Ala Pro Gly Ala 
370 375 3B0 

Asp Ser Cys Thr Cys Phe Val Asn Arg Ala Ser Val Gin Gly Leu Ala 

385 390 395 400 



Gly Ala Phe Gly Ala Glu Pro Ala Pro Val Ala Gly Pro Gly Arg Ser 
405 410 415 



• 



Gly Glu Pro Cys Gly Cys Gly Leu Arg Glu Ala Val Asp Gly Val Arg 
420 425 430 

Phe lie Ala Asp His Met Arg Ser Glu Asp Asp Asp Glrv Ser Val Ser 
435 440 445 

Glu Asp Trp Lye Tyr Val Ala Met Val lie Asp Arg Leu Phe Leu Trp 
450 455 460 

He Phe Val Phe Val Cys Val Phe Gly Thr He Gly Met Phe Leu Gin 
465 470 475 4B0 

Pro Leu Phe Gin Asn Tyr Thr Thr Thr Thr Phe Leu His Ser Asp His 
465 490 495 

Ser Ala Pro Ser Ser Lys 
500 



(2) INFORMATION FOR SEQ ID NQilS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1927 base pairs 

(B) TYPE: nucleic acid 
rC) STHANDEDi5ESS : both 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE; cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 98.-1474 

(D) OTHER INFORMATION: /product= "BETA-3 SUBUNIT" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCGGAACCCC TGTATTTTCT TTTCAAAACC CCCTTTTCCA GTGGAAATGC TCTGTTGTTA 60 

AAAAGGAAGA AACTGTCTTT CTGAAACTCA CATCACG ATG CTC CCA GAT TTT ATG 115 

Met Leu Pro Asp Phe Met 
1 5 



CTG GTT CTC ATC GTC CTT GGC ATC CCT TCC TCA GCC ACC ACA GGT TTC 163 
Le\i Val Leu He Val Leu Gly lie Pro Ser Ser Ala Thr Thr Gly Phe 
10 15 20 

AAC TCA ATC GCC GAA AAT GAA GAT GCC CTC CTC AGA CAT TTG TTC CAA 211 
Asn Ser lie Ala Glu Asn Glu Asp Ala Leu Leu Arg His Leu Phe Gin 
25 30 35 

GGT TAT CAG AAA TGG GTC CGC CCT GTA TTA CAT TCT AAT GAC ACC ATA 259 
Gly Tyr Gin Lye Trp Val Arg Pro Val Leu His Ser Asn Asp Thr He 
40 45 50 

AAA GTA TAT TTT GGA TTG AAA ATA TCC CAG CTT GTA GAT GTG GAT GAA 307 
Lya Val Tyr Phe Gly Leu Lys He Ser Gin Leu Val Asp Val Asp Glu 
55 60 65 70 

AAG AAT CAG CTG ATG ACA ACC AAT GTG TGG CTC AAA CAG QAA TGG ACA 355 
Lye Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lya Gin Glu Trp Thr 
75 eO 85 

GAC CAC AAG TTA CGC TGG AAT CCT GAT GAT TAT OGT GGG ATC CAT TCC 403 
Asp His Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly He His Ser 
90 95 100 



ATT AAA GTT CCA TCA GAA TCT CTG TGG CTT CCT GAC ATA GTT CTC TXT 451 
He Lys Val Pro Ser Glu Ser Leu Trp Leu Pro Asp He Val Leu Phe 
105 no 115 

GAA AAT GCT GAC GGC CGC TTC GAA GGC TCC CTG ATG ACC AAG GTC ATC 499 
Glu Agn Ala A3p Gly Arg Phe Glu Gly Ser Leu Met Thr Lys Val He 
120 125 130 

GTG AAA TCA AAC GGA ACT GTT GTC TGG ACC CCT CCC CCC AGC TAG AAA 547 
val Lys Ser Aan Gly Thr Val Val Trp Thr Pro Pro Ala Ser Tyr Lys 
135 140 145 150 

AGC TCC TGC ACC ATG GAC GTC ACQ TTT TTC CCG TTC GAC CGA CAG AAC 595 
Ser Ser Cys Thr Met Asp Val Thr Phe Phe Pro Phe Asp Arg Gin Aen 
155 160 165 

TGC TCC ATG AAG TTT GGA TCC TGG ACT TAT GAT GGC ACC ATG GTT GAC 643 
Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Thr Met Val Asp 
170 . 175 180 

CTC ATT TTG ATC AAT GAA AAT GTC GAC AGA AAA GAC TTC TTC GAT AAC 691 
Leu He Leu He Ascx Glu Asn Val Asp Arg Lys Asp Phe Phe Asp Asn 
165 190 195 

GGA GAA TGG GAA ATA CTG AAT GCA AAG GGG ATG AAG GGG AAC AGA AGG 733 
Gly Glu Trp GXu He Leu Aen Ala Lys Gly Met Lye Gly Asa Arg Arg 
200 205 210 

GAC GGC QTG TAC TCC TAT CCC TTT ATC ACG TAT TCC TTC GTC CTG AGA 787 
Asp Gly Val Tyr Ser Tyr Pro Phe He Thr Tyr Ser Phe Val Leu Arg 
215 220 225 230 

CGC CTG CCT TTA TTC TAT ACC CTC TTT CTC ATC ATC CCC TGC CTG GGG 835 
Arg Leu Pro Leu Phe TVr Thr Leu Phe Leu He He Pro Cys Leu Gly 
235 240 245 

CTG TCT TTC CTA ACA GTT CTT GTG TTC TAT TTA CCT TC6 GAT GAA GGA 663 
Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Glu Gly 
250 255 260 

GAA AAA CTT TCA TTA TCC ACA TCG GTC TTG GTT TCT CTG ACA GTT TTC 931 
Glu Lys Leu Ser Leu Ser Thr Ser Val Leu Val Ser Leu Thr Val Phe 
26S 270 275 

CTT TTA GTG ATT GAA GAA ATC ATC CCA TCG TCT TCC AAA GTC ATT CCT 979 
Leu Leu Val He Glu Glu He He Pro Ser Ser Ser Lys Val He Pro 
280 2B5 290 

CTC ATT GGA GAG TAC CTG CTG TTC ATC ATG ATT TTT GTG ACC CTG TCC 1027 
Leu He Gly Glu Tyr Leu Leu Phe He Met He Phe Val Thr Leu Ser 
295 300 305 310 

ATC ATT GTT ACC GTG TTT GTC ATT AAC GTT CAC CAC AGA TCT TCT TCC 1075 
He He Val Thr Val Phe Val He Asn Val His ais Arg Ser Ser Ser 
315 320 325 

ACG TAC CAC CCC ATG GCC CCC TGG GTT AAG AGG CTC TTT CTG CAG AAA 1123 
Thr Tyr His Pro Met Ala Pro Trp Val Lys Arg Leu Phe Leu Gin Lys 
330 335 340 

CrX CCA AAA TTA CTT TGC ATG AAA GAT CAT GTG GAT CGC TAC TCA TCC 1171 
Leu Pro Lys Leu Leu Cys Met Lys Asp Bis Val Asp Arg Tyr Ser Ser 
345 350 355 

CCA GAG AAA GAG GAG AGT CAA CCA GTA GTG AAA GGC AAA GTC CTC GAA 1219 
Pro Glu Lye Glu Glu Ser Gin Pro Val Val Lye Gly Lys Val Leu Glu 
360 365 370 



AAA AAG AAA CAG AAA CAG CTT AGT GAT GGA GAA AAA GTT CTA GTT GCT 1267 
Lys Lye Lys Gin Lys Gin Leu Ser Asp Gly Glu Lys Val Leu Val Ala 
375 380 365 390 

TTT TTG GAA AAA GCT GCT GAT TCC ATT AGA TAG ATT TCC AGA CAT QTG 1315 
Phd Leu Glu Lys Ala Ala Asp Sex He Axg Tyr He Ser Arg His Val 
3d5 400 40B 

AAG AAA GAA CAT TTT ATC AGC CAG GTA GTA CAA GAC TGG AAA TTT GTA 1363 
Ly6 Lye Glu His Phe He Ser Gin Val Val Gin Asp Trp Lys Phe Val 
410 415 420 

GCT CAA GTT CTT GAC CGA ATC TTC CTG TGG CTC TTT CTG ATA GTG TCA 1411 
Ala Gin Val Leu Asp Arg He Phe Leu Trp Leu Phe Leu lie Val Ser 
425 430 435 

GCA ACA GGC TCG GTT CTG ATT TTT ACC CCT GCT TTG AAG ATG TGG CTA 1459 
Ala Thr Gly Ser Val Leu He Phe Thr Pro Ala Leu Lys Met Trp Leu 
440 - 445 450 

CAT AGT TAC CAT TAGGAATTTC AAAAGACATA AGTACTAAAT TACACCTTAG 1511 

His Ser Tyr His 

455 



ACCTGACATC TGGCTATCAC ACAGACAGAA 


TCCAAATGCA 


TGTGCTTGTT 


ctacgaaccc 


1571 


CGAATGCGTT GTCTTTGTGG AAATGGAACA 


tctcctcatg 


GGAGAAACTC 


TGGTAAATGT 


1631 


GCTCATTTGT GGTTGCCATG AGAGTGAGCT 


QCTTTTAAAG 


AAAGTGGAGC 


CTCCTCAGAC 


1691 


CCCTGCCTTQ GCTTTCCCAG ACATTCAGGG 


agggatcata 


GGTCCAGGCT 


tgagctcaca 


1751 


TGTGGCCAGA OTGCACAAAA AGCTGTTGCT 


ACTTGGTGGA 


GGAACACCTC 


CTAGAAGCAG 


IBll 


CAGGCCTCGG TGGTGGGGGA GGGGGGATTC 


ACCTGGAATT 


AAGGAAGTCT 


CGGTGXCGAG 


1871 


ctatctgtgt QGGCAGAGCC TGGATCTCCC 


ACCCTGCACT 


GGCCTCCTTG 


GTGCCG 


1927 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCS CHARACTERISTICS i 

(A) LSNGTH; 456 amino acids 
CB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO1I61 

Met Leu Pro Asp Phe Met Leu Val Leu He Val Leu Gly He Pro Ser 
1 5 10 15 

Ser Ala Thr Thr Gly Phe Asn Ser He Ala Glu Aan Glu Asp Ala Leu 
20 25 30 

Leu Arg Hie Leu Phe Gin Gly Tyr Gin Lys Trp Val Arg Pro Val Leu 
35 40 45 

His Ser A»n Asp Thr He Lye Val Tyr Phe Gly Leu Lye He Ser Gin 
50 55 60 

Leu Val Ap Val Asp Glu Lys Asn Gin Leu Met Thr Thr Aen Val Trp 
65 70 75 60 

Leu Lye Gin Glu Trp Thr Asp His Lys Leu Arg Trp Asn Pro Asp Asp 
B5 90 95 



Ala Leu Lys Met Trp Leu His Ser lyr His 
450 455 



(2) INFORMATION K)R SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1915 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY I both 

(ii) MOLECULE TYPE; cDNA 

(ix) FEATURE: 

<A) NAME/KEY: CDS 

(B) LOCATION; 67.. 1583 

(D) OTHER INFORMATION: /prodUCt= "BETA-4 SUBUNIT 
(Xi) SEQUENCE DESCRIPTION i SEQ ID NO: 17: 



CCGGCGCTCA 


CTCGACCGCG 


CGGCTCACGG 


GTGCCCTQTG ACCCCACAGC 


GGAGCTCGCG 


eo 


GCGQCTQCCA 


CCCGGCCCCG 


CCGGCCATGA 


GGCGCGCGCC TTCCCTGGTC 


CTTTTCTTCC 


120 


TGGTCGCCCT 


TTGCGGGCGC 


QGGAACTGCC 


GCGTGGCCAA TGCGGAGGAA 


AAGCTGATGG 


16D 


ACGACCTTCT 


GAACAAAACC 


CGTTACAATA 


ACCTGATCCG CCCAGCCACC 


AGCTCCTCAC 


240 


AGCTCATCTC 


CATCAAGCTG 


CAGCTCTCCC 


TGGCCCAGCT TATCAQCGTG 


AATGAGCGAG 


300 


AGCAGATCAT 


GACCACCAAT 


GTCTGGCTGA 


AACAGGAATG GACTGATTAC 


CGCCTGACCT 


360 


CGAACAGCTC 


CCGCTACGAG 


GGTGTGAACA 


TCCTGAGGAT CCCTGCAAAG 


CGCATCTGGT 


420 


TGCCTGACAT 


C6TGCTTTAC 


AACAACGCCG 


ACGGGACCTA TGAGGTGTCT 


GTCTACACCA 


480 


ACTTGATAGT 


CCGGTCCAAC 


GGCAGCGTCC 


TGTGQCTGCC CCCTGCCATC 


TACAAGAGCG 


540 


CCTGCAAGAT 


TGAGGTGAAG 


TACTTTCCCT 


TCGACCAGCA GAACTGCACC 


CTCAAGTTCC 


600 


CCTCCTGGAC 


CTATGACCAC 


ACGGAGATAG 


ACATGGTCCT CATGACGCCC 


ACAGCCAGCA 


660 


TGGATGACTT 


TACTCCCAGT 


GGTGAGTQGG 


ACATAGTGGC CCTCCCAGGG 


AGAAGGACAG 


720 


TGAACCCACA 


AGACCCCAGC 


TACGTGGACG 


TGACTTACGA CTICATCATC 


AAGCGCAAGC 


780 


CTCTGTTCTA 


CACCATCAAC 


CTCATCATCC 


CCTGCGTGCT CACCACCTTG 


CTGGCCATCC 


B4D 


TCCTCTTCTA 


CCTQCCATCC 


GACTGCGGCG 


AfiAAGATGAC ACTGTGCATC 


TCAGTGCTGC 


900 


TGGCACTGAC 


ATTCTTCCTG 


CTGCTCATCT 


CCAAGATCGT GCCACCCACC 


TCCCTCGATG 


9$0 


TOCCTCTCAT 


CGGCAAGTAC 


CTCATCTTCA 


CCATGGTGCT GGTCACCTTC 


TCCATCGTCA 


1020 


CCAQCGTCTG 


TGTGOTCAAT 


GTGCACCACC 


GCTCCCCCAG CACCCACACC 


ATGGCACCCT 


1080 


GGGTCAAOCG 


CTGCTTCCTO 


CACAAGCTGC 


CTACCTTCCT CTTCATGAAG 


CGCCCTGGCC 


1140 


CCGACAGCAG 


CCCGGCCAGA 


OCCTTCCCGC 


CCAQCAAGTC ATGCGTGACC 


AAGCCCGAGG 


1200 


CCACCGCCAC 


CTCCACCAGC 


CCCTCCAACT 


TCTATGGCAA CTCCATGTAC 


TTTGTGAACC 


12€0 


CCGCCTCTGC 


AGCTTCCAAQ 


TCTCCAGCCG 


GCTCTACCCC 6GTGGCTATC 


CCCAGGGATT 


1320 


TCTGGCTGCG 


GTCCTCTGGG 


AGCTTCCGAC 


AGGATQTGCA GGAGGCATTA 


GAAGGTGTCA 


1360 



CCTTCATCGC CCAGCACATG AAGAATGACG ATGAAGkCCA GAGTGTCGTT GAGGACTGGA 1440 

AGTACGTGGC TATGGTGGTG GACCGGCTGT TCCTGTGGGT GTTCATGTTT GTGTGCGTCC 1500 

TGGGCACT6T GGGGCTCTTC CTCCCOCCCC TCTTCCAGAC CCATGCAGCT TCTGAGGGGC 1560 

CCTACCCTGC CCAGCGTGAC TGAGGGCCCC CTGGGTTGTG GGGTGAGAGG ATGTGAGTGG 1620 

CCGGGTGGGC ACTTTGCTGC TTCTTTCTGG GTTGTGGCCG ATGAGGCCCT AAGTAAATAT 1680 

GTGAGCATTG GCCATCAACC CCATCAAACC AGCCACAQCC GTGGAACAGG CAAGGATGGG 1740 

GGCCTGGCCT GTCCTCTCTG AATGCCTTGG AGGGATCCCA GGAAGCCCCA GXAGGAGGGA IBOO 

GCTTCAQACA GTTCAATTCT GGCCTGTCTT CCTTCCCTGC ACCGGGCAAT GGGGATAAAG 1660 

ATGACTTCGT AGCAGCACCT ACTATGCTTC AGGCATGCTG CCCCCCTGCC TCTCC 1915 
(2) INFORMATION FOR SEQ ID NOtlS: 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 498 amino acidB 

(B) TYPE: amino acid 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPEi protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Arg Arg Ala Pro Ser Leu Val Leu Phe Phe Leu Val Ala Leu Cys 
15 10 15 

Gly Arg Gly Asn Cys Arg Val Ala Asn Ala Glu Glu Lys Leu Met Asp 
20 25 30 

Aep Leu Leu Asn Lys Thr Arg Tyr Asn Aen Leu lie Arg Pro Ala Thr 
35 40 45 

Ser Ser Ser Gin Leu lie Ser He Lys Leu Gin Leu Ser Leu Ala Gin 
50 55 60 

Leu He Ser Val Asn Glu Arg Glu Gin He Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lye Gin Glu Trp Thr Asp Tyr Arg Leu Thr Trp Asn Ser Ser Arg 
85 90 95 

Tyr Glu Gly Val A$n He Leu Arg He Pro Ala Lys Arg He Trp Leu 
100 105 110 

Pro Asp He Val Leu Tyr Asn Asn Ala Asp Gly Thr Tyr Glu Val Ser 
115 120 125 

Val Tyr Thr Asn Leu He Val Arg Ser Asn Gly Ser Val Leu Trp Leu 
130 135 140 

Pro Pro Ala He Tyr Lys Ser Ala Cys Lys He Glu VaJ. Lys Tyr Phe 
145 150 155 160 

Pro Phe Asp Gin Gin Aen Cys Thr Leu Lys Phe Arg Ser Trp Thr Tyr 
165 170 175 

Asp His Thr Glu He Asp Met Val Leu Met Thr Pro Thr Ala Ser Met 
180 185 190 

Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp He Val Ala Leu Pro Gly 
195 200 205 



• 



Arg Arg Thr Val Asa Pro Gin Asp Pro Ser Tyr Val Asp Val Thr Tyr 
210 215 220 

Asp Phe II lie Lys tog Lye Pro Leu Phe Tyr Thr lie Aan Leu lie 
225 230 235 240 

lie Pro Cys Val Leu Thr Thr Leu Leu Ala He leu Val Phe Tyr Leu 
245 250 255 

Pro Ser Asp Cys Gly Glu Lye Met Thr Leu Cys He Ser Val Leu Leu 
260 265 270 

Ala Leu Thr Phe Phe Leu Leu Leu He Ser Lye He Val Pro Pro Thr 
275 280 265 

Ser Leu Asp Val Pro Leu He Gly Lys Tyr Leu Met Phe Thr Met Val 
290 295 300 

Leu Val Thr Phe Ser He Val Thr Ser Val Cys Val Leu Abu Val His 
305 310 315 320 

His Arg Ser Pro Ser Thr His Thr Met Ala Pro Trp Val Lya Arg Cys 
325 330 335 

Phe Leu His Lve Leu Pro Thr Phe Leu Phe Met Lys Arg Pro Gly Pro 
340 345 350 

Asp Ser Ser Pro Ala Arg Ala Phe Pro Pro Ser Lys Ser Cys Val Thr 
355 360 365 

Lys Pro Glu Ala Thr Ala Thr Ser Thr Ser Pro Ser Asn Phe Tyr Gly 
370 375 380 

Aan Ser Met Tyr Phe Val Asn Pro Ala Ser Ala Ala Ser Lya Ser Pro 
365 390 395 400 

Ala Gly Ser Thr Pro Val Ala He Pro Arg Asp Phe Trp Leu Arg Ser 
405 410 415 

Ser Gly Arg Phe Arg Gin Asp Val Gin Glu Ala Leu Glu Gly Val Ser 
420 42S 430 

Phe He Ala Gin His Met Lys Asn Asp Asp Glu Asp Gin Ser Val Val 
435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val Val Asp Arg Leu Phe Leu Trp 
450 455 460 

Val Phe Met Phe Val Cys Val Leu Gly Thr Val Gly Leu Phe Leu Pro 
465 470 475 480 

Pro Leu Phe Gin Thr His Ala Ala Ser Glu Gly Pro Tyr Ala Ala Gin 
485 490 495 



Arg Asp 
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